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Introduction 

(1) 
Rollett and Wilkinson % ' have discussed a rearrangement of 

(2) 
the order of calculation of the Givens % ' method for the reduction of 

a symmetric matrix to a codiagonal form vhich reduces the number of 

times that the matrix must be read from a backing store. In their 

paper they remark that the Householder method (Wilkinson w; ) appears to 

require twice as many scans of the data, and consequently, for slov 

backing stores, is slower than Givens method, although taking 1/2 the 

number of multiplications. This note suggests an organization of the 

computations of the Householder method ^which maintains the arithmetic 

advantage and only requires the same number of scans of the data as the 

Roilett-Wliklnson method. 

Simply stated, the methods consists of calculating during one 
scan only the two vectors necessary to make the transformation but 
delaying making the transformation until the matrix is read out for the 
calculation of the vectors during the next pass. 



I. Hfethcd 

The Householder method involves finding a unit vector 

m 
v = (x^, x 2 $ — , X Q ) 

such that the orthogonal matrix 

T 'S 

]r = p = i-2wv_ 

can be used to transforiB the matrix A so that PAP has zeros in the 
(1,3)1 (X*0; (l*55l — said (l,n) positions. w_ can be chosen so that 
x, = 0. !She process is then repeated on the sub-matrix obtained by 
deleting the first row and column of PAP. The final result after 

a - 2 applications is a codiagonal matrix. 
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The following notation is used: 

Loner case letters are data and intermediate results. In 
particular j, a. , are the elements of A* x. are the elements 
of w, and p ± and g i are the elements of the Wilkinson^" 5 ' 
vectors p_ and g. 

Upper case letters refer to storage locations in the main 

random access memory. Five hanks of storage of n locations 

each are needed. They are A J[ , B^ C^, P ± and <k ±t A ± is 

used to hold the last row of the matrix read from the hacking 

store. Bj is used to put the x in after these are calculated. 

C A holds the x ± from the last scan. P is used to accumulate 

the elements of p_ as thay are formed. [These should be more 

than single precision since they accumulate an inner product.] 

Q. hold the q. calculated on the last scan. Initially C. and Q. 
. i i 

must he cleared to zero. (Or better still, terms involving 

them not calculated on the first scan.) 

The calculation steps are given in terms of the storage locationsj 

the contents of the locations are mentioned to aid the reader folloving 

(3) 

the Wllkinaon w ' paper. The matrix A is assimed to he in backing store, 

stored by row in its upper triangular form. The result of one pass is to 
transform the matrix, reducing the last n - 2 eleaents of its first row 
to zero, and to use the storage locations vacated to save the vector w 
for future work, n - 2 applications of the process to successive 
sub-matrices reduce the matrix to the required form. 

The arithmetic and data moving for the r pass is as follows: 

1) 

•- n 

2) A^ - A^ - QjC r - Q^Cj ; J - r, r+1, - n 
(This step need not be performed when r - 1.) 

3) ■-♦<TMS fl + < ie *-- + jJ) 
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Head the 


th 
r row: 


x i 


"^J 


* 
* 


i = r, 


A i 


-v 




"Vj 



5) B k = \ = Q \ * k - r+2, »3, - 



sign (A ) 
where Q • -— — fZ±_ 
2B r+l S 



6) B p = -S sign (a^) (This is the new A r ^ element) 

7) P r " *r " B r + 1 3 **** C^) 

8) Restore the r rov of the matrix using the contents of 
V B r+1> B r+2» — ' B a 

(The actual value of the r th row ie A . B , 0, 0. — o 
but the codiagonal is being saved in locations B „ B , — 
5^ as it is calculated so that there are r - 1 locations 
spare in the r rov to store the v vector.) 

9) Set P = p _ = p rt 3 p 

x* r+l r*S n * 

10} A. Read the k rov of the matrix into A 

i,e * A J =a k5 ' J " k ' k+1 > ~» 

B. Calculate A» - A^ - Q^ - Q^ , J=k, -- n 

'l'Ais BX/tJp aau "uues next «e*u ue uwuuuoouu'waen r = x. 

C. Store A. , -- A hack as the nev k row 
i.e. a^ . A j | J = k, — n 

*1 

P^ = P^ + BjA^ | J * fcfl, — n 

These products should be accumulated double precision. 
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E. Do steps 10A to 10D for k = r*l„ r&2 9 — n 



tt l 



11. T =.5^ P.B. (This is the Wilkinson K = £ A w) 



12. Qj = 2(Pj - TB ) | j = 3N-1, — n 



(These are double the elements q. which are used to form 
the transfonoedoatric.) 

(EAP)^ - (A - 8 v £ - 2 a £) i<3 - .„ - B A - B^) 

13. Save the B for the next step in storage hank C, 
C« » B. | ,3 = r+1, — n 

When passes r = 1^ r = 2,, — r « n - 2 have been completed,? 

the last two rovSj, namely elements a^ . . ., a and a must he 

n-l,n-l n-l; n nyn 

updated, The latter is in A so can be treated Immediately with the 
formulas 

A' - A - 2 QC 
n n sa 

th 

This element can he read hack and then the (n-l) row can 

he read and treated appropriately. If after this has been done, A q (now 

containing a -i is stored in B j, **» co-diagonal is in locations 

B . B , — . B , and the diagonal is in the hacking store in its original 

1* 2* ' n-1 
position, The locations originally holding the off diagonal elements now 

contain the various w used in the transformation. 

H. Co mparison with other methods . 

■a 

This process takes approximately 2n /3 multiplications, there 
are no additional multiplications over the usual Householder method due to 
this organization of calculation. This compares to approximately *m /3 
multiplications in the symmetric Given' s method. The number of backing 
store references is identical to the Rollett-Wilkinson version of Given' s 
method or to methods taking advantages of special characteristics of the 
backing store to be able to read rows or columns at full speed. 



fhls method uses 6n storage locations holding n double precision 
and 4n single precision numbers in the main store^ against kn la the 
Rollett-WilkLason method. For unsymffietrie matrices, a staple extension 
of this technique uses approximately 5* -/3 multipli cations s against 

■3 

approximately 10*1^/3 for Givens., hut it now requires 7n storage locations 
in the main memory since it is now necessary to calculate both p - A w 
and v = w A as A is unsyametricj although only one requires a double 
precision hank of storage^ since now a full row of A is read each time 
so that each element of £ can he calculated double precision and then 
rounded to single precision before being stored. 
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